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(54) Web data acquisition apparatus and method, and storage medium storing program for this 
method 



(57) It is an object to provide a Web data acquisition 
apparatus and method which can automatically and col- 
lectively acquire contents of URLs matching a query 
condition without any manual operation in a search sys- 
tem on the Internet, and a storage medium storing a 
program for the method. Even if URLs matching a query 
condition extend in a plurality of pages at a search site 
on the Internet, information is automatically acquired 
upon detection of a next page, thus automatically and 
collectively acquiring the contents of the URLs matching 
the query condition without any manual operation. In 
addition, a display condition is input, and only informa- 
tion of designated fields is displayed as a search result. 
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Description 

BACKGROUND OF THE INVENTION 

[Field of the Invention] s 

[0001] The present invention relates to an appara- 
tus and method for collectively acquiring Web data 
existing on the Internet or the like, and a storage 
medium storing a program for the method. 10 

[Description of the Related Art] 

[0002] Conventionally, various programs for collec- 
tively acquiring various kinds of information on home is 
pages on the Internet (to be referred to as Web data 
hereinafter) have been implemented. For example, a 
program is available, which is designed to access home 
pages from the URL (Uniform Resource Locator) of a 
given home page up to a nesting level upon designation 20 
of the URL, the nesting level, and the like, and acquire 
URL information existing in the home pages. After the 
URL information is acquired, a user selects offline, from 
the acquired URL information list, specific URLs indicat- 
ing specific pages to be downloaded. When the user 25 
logs in to the Internet again, the pages of the selected 
URLs are collectively downloaded. 
[0003] According to a known program, when the 
user designates the URL of a given home page and a 
nesting level, all home pages from the URL to the nest- 30 
ing level are downloaded. 

[0004] Search pages for searching for various kinds 
of information exist in the Internet. When the user inputs 
a query condition such as a keyword on a search page, 
a list of URLs of home pages matching the query condi- 35 
tion is displayed. If many URLs are retrieved, a list is dis- 
played in a plurality of pages. When the user designates 
(clicks) a desired URL in the displayed list, the corre- 
sponding page is displayed on the browser. For exam- 
ple, the following function is provided by the home page 40 
of the US Patent and Trademark Office. When the user 
inputs a query condition on this home page, patents 
with contents matching the query condition are dis- 
played as a list of URLs. When the user designates a 
desired URL (patent) from the list, the contents of the 45 
patent are displayed. 

[0005] With the above search, however, only a list of 
URLs can be obtained as a search result, but the con- 
tents of the home pages indicated by the URLs are not 
automatically downloaded. When, therefore, the user so 
wants to acquire the contents of the home pages indi- 
cated by the retrieved URLs, the user must click the 
URLs in the retrieved URL list one by one to acquire the 
contents. This is cumbersome operation. 
[0006] With the use of the above Web data collec- ss 
tive acquisition program, Web data can be automatically 
acquired by tracing all URLs (or URLs selected by the 
user) existing in a home page. If, however, there are 
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many URLs as a search result, a list of URLs is dis- 
played in a plurality of pages. Since the conventional 
Web data collective acquisition program has no page 
turning function, the user must manually designate a 
range of Web data to be downloaded. 

SUMMARY OF THE INVENTION 

[0007] It is an object of the present invention to pro- 
vide a Web data acquisition apparatus and method 
which can automatically and collectively acquire the 
contents of URLs matching a query condition without 
any manual operation in a search system on the Inter- 
net, and a storage medium storing a program for the 
method. 

[0008] In order to achieve the above object, accord- 
ing to the present invention, there is provided a Web 
data acquisition apparatus for acquiring Web data of a 
page for displaying or retrieving information matching a 
query condition upon inputting the query condition on 
the Internet, an intranet, or an extranet, comprising 
means for designating a page as a search target, 
means for inputting a query condition, means for gener- 
ating a request for giving an instruction to execute a 
search under the query condition, and transmitting the 
request to a server as a search target, means for receiv- 
ing a response as a search result, storage means for 
storing contents of the response, means for detecting 
presence/absence of a next page from the contents, 
and means for, if the next page is detected, performing 
operation of acquiring data of the next page by transmit- 
ting a request to acquire the next page and storing the 
data in the storage means, and repeating the operation 
as long as a next page is detected. 
[0009] According to the present invention, there is 
provided a Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving informa- 
tion matching a query condition upon inputting the 
query condition on the Internet, an intranet, or an 
extranet, comprising means for designating a page as a 
search target, means for inputting a query condition, 
means for generating a request for giving an instruction 
to execute a search under the query condition, and 
transmitting the request to a server as a search target, 
means for receiving a response as a search result, stor- 
age means for downloading the linked page by tracing a 
link in the response, and storing the downloaded page 
data, means for detecting presence/absence of a next 
page from the response, and means for, if the next page 
is detected, performing operation of acquiring data of 
the next page by transmitting a request to acquire the 
next page, downloading the linked page by tracing a link 
in the acquired page data, and storing the downloaded 
page data in the storage means, and repeating the 
operation as long as a next page is detected. 
[0010] The apparatus may further comprise means 
for designating a nesting level at which information is 
acquired, and acquires information by tracing a URL 
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contained in the contents up to the nesting level. 
[0011] In the apparatus according to the present 
invention, a search is made for information stored in 
said storage means under another query condition, and 
the research result is stored in the storage means. s 
[0012] The apparatus may further comprise means 
for inputting a display condition for determining informa- 
tion of a specific field of retrieved information, and the 
information of the field designated under the display 
condition is extracted from the information stored in the 10 
storage means and is displayed. 
[0013] The apparatus may further comprise means 
for terminating information acquisition on the basis of a 
predetermined condition instead of continuing informa- 
tion acquisition as long as a next page is detected. 15 
[0014] According to the present invention, there is 
provided a Web data acquisition method of acquiring 
Web data of a page for displaying or retrieving informa- 
tion matching a query condition upon inputting the 
query condition on the Internet, an intranet, or an 20 
extranet, comprising the step of designating a page as a 
search target, the step of inputting a query condition, 
the step of generating a request for giving an instruction 
to execute a search under the query condition, and 
transmitting the request to a server as a search target, 25 
the step of receiving a response as a search result, the 
storage step of storing contents of the response, the 
step of detecting presence/absence of a next page from 
the contents, and the step of, if the next page is 
detected, performing operation of acquiring data of the 30 
next page by transmitting a request to acquire the next 
page and storing the data in the storage step, and 
repeating the operation as long as a next page is 
detected. 

[0015] According to the present invention, there is 35 
provided a Web data acquisition method of acquiring 
Web data of a page for displaying or retrieving informa- 
tion matching a query condition upon inputting the 
query condition on the Internet, an intranet, or an 
extranet, comprising the step of designating a page as a aq 
search target, the step of inputting a query condition, 
the step of generating a request for giving an instruction 
to execute a search under the query condition, and 
transmitting the request to a server as a search target, 
the step of receiving a response as a search result, the 45 
storage step of downloading the linked page by tracing 
a link in the response, and storing the downloaded page 
data, the step of detecting presence/absence of a next 
page from the response, and the step of, if the next 
page is detected, performing operation of acquiring data so 
of the next page by transmitting a request to acquire the 
next page, downloading the linked page by tracing a link 
in the acquired page data, and storing the downloaded 
page data in the storage step, and repeating the opera- 
tion as long as a next page is detected. 55 
[0016] The method may further comprise the step 
of designating a nesting level at which information is 
acquired, and acquires information by tracing a URL 



contained in the contents up to the nesting level. 
[0017] The method may further comprise making a 
search for information stored in the storage step under 
another query condition, and storing the research result 
as a file. 

[0018] The method may further comprise the step 
of inputting a display condition for determining informa- 
tion of a specific field of retrieved information, and the 
information of the field designated under the display 
condition is extracted from the information stored as the 
file and is displayed. 

[0019] The method may further comprise terminat- 
ing information acquisition on the basis of a predeter- 
mined condition instead of continuing information 
acquisition as long as a next page is detected. 
[0020] According to the present invention, there is 
provided a program storage medium storing a program 
for the Web data acquisition method defined in any one 
of claims 9 to 14. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0021] 

Fig. 1 is a block diagram showing the overall 

arrangement of a search system to which a Web 

data acquisition apparatus and method of the 

present invention are applied; 

Fig. 2 is a view showing an example of an initial 

search window on a client; 

Fig. 3 is a view showing an example of a search 

result display; 

Rg. 4 is a flow chart showing a search procedure; 
Figs. 5A to 5C are views each showing an example 
of the arrangement of a target URL table; 
Rg. 6 is a flow chart showing a procedure for out- 
putting retrieved information; 
Rg. 7 is a view showing an example of a window of 
the search page of the US Patent and Trademark 
Office; 

Rg. 8 is a view showing HTML sources (first) of the 
search page in Fig. 7; 

Rg. 9 is a view showing HTML sources (second) of 
the search page in Rg. 7; 

Rg. 10 is a view showing HTML sources (third) of 
the search page in Rg. 7; 

Rg. 1 1 is a view showing an example of a window 

of the search page of the Japanese Patent Office; 

Fig. 12 is a view showing HTML sources (first) of 

the search page in Rg. 1 1 ; 

Rg. 13 is a view showing HTML sources (second) 

of the search page in Fig. 1 1 ; 

Rg. 14 is a view showing HTML sources (third) of 

the search page in Rg. 1 1 ; 

Rg. 15 is a view showing HTML sources (fourth) of 

the search page in Rg. 1 1 ; 

Rg. 1 6 is a view showing an example of an initial 

search window in the second embodiment of the 
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present invention; and 

Fig. 1 7 is a flow chart showing a DL subroutine. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 5 

[0022] Embodiments of the present invention will be 
described below with reference to the accompanying 
drawings. 

[0023] Fig. 1 shows the overall arrangement of a 10 
search system to which a Web data acquisition appara- 
tus and method according to the present invention are 
applied. Clients 101 and 102 and a server 103 as a 
search site are connected to the Internet 100. Each of 
the clients 1 01 and 1 02 uses a browser (in which a pro- 15 
gram according to the present invention is installed) to 
connect to the server 103 and make a search by input- 
ting a query condition. Although the Internet will be 
exemplified below, the present invention can also be 
applied to an intranet used within a company or building 20 
or an extranet used among a plurality of companies or 
buildings. 

[0024] Fig. 2 shows an example of an initial search 
window on a client. Reference numeral 201 denotes an 
area for designating a server as a search target. Several 25 
search targets are registered in advance (for example, 
Alta Vista, goo, and Yahoo!). A list of search targets is 
displayed by clicking an arrow button on the right side of 
the area 201. A user can select one target from the list. 
Referring to Fig. 2, the home page of the US Patent and 30 
Trademark Office is designated as a search target. 
[0025] Reference numeral 202 denotes an example 
of a query input window. Queries that can be input vary 
with the search targets selected by the search target 
selection area 201 . When, therefore, the search target 35 
is changed, the window 202 is switched to another win- 
dow. Referring to Fig. 2, when the home page of the US 
Patent and Trademark Office is designated as a search 
target, the window 202 for inputting a query that can be 
designated is displayed. On the window 202, reference 40 
numeral 221 denotes an input area for a keyword 1; 
222, an area for designating a field in which the keyword 
1 exists; 224, an input area for a keyword 2; 225, an 
area for designating a field in which the keyword 2 
exists; 223, an area for designating whether to search 45 
for the keywords 1 and 2 with the AND operator or OR 
operator; and 226, an area for designating a range of 
years to search. An input example in Fig. 2 designates 
a search for a patent containing "ABC cooperation* as 
the keyword 1 in "Applicant" field and "xxxx" as the key- so 
word 2 in any of all fields from patents in the range of 
years 1 998 to 1 999. 

[0026] Reference numeral 203 denotes a window 
for designating a field to be displayed as a search result. 
A list of the names of fields that can be displayed is dis- 55 
played in a left box 231 . The user can designate fields to 
be displayed by operating buttons 233 to 236. The des- 
ignated fields are displayed in a right box 232. The but- 



ton 233 is used to designate all the fields in the left box 
231 and give an instruction to display them in the right 
box 232. The button 234 is used to designate fields 
selected by the user in the left box 231 and give an 
instruction to display the designated fields in the right 
box 232. Note that the user can select each field by 
clicking a corresponding field name in the left box 231. 
Each selected field name is highlighted. The button 235 
is used to give an instruction to detete all fields as dis- 
play targets in the right box 232. The button 236 is used 
to give an instruction to delete, as display targets, fields 
selected by the user in the right box 232. 
[0027] When the user inputs the above query condi- 
tion and display condition and clicks a start button 204, 
a search is started. Reference numeral 205 denotes a 
cancel button for canceling a search. Assume that in 
addition to information to be input on the window in Fig. 
2, the user designates beforehand a nesting level that 
indicates the extent to which home pages nested in a 
home page as a search target serving as a parent are to 
be regarded as search targets. 
[0028] Fig. 3 shows a search result example dis- 
played after a search is conducted by the user upon 
clicking the start button 204 on the initial search window 
in Fig. 2. Since the field names designated as search 
targets in the box 232 in Fig. 2 are 'No.', ■Applicant", 
"Title", and "Abstract", the contents of these fields asso- 
ciated with the retrieved patent are displayed, as shown 
in Fig. 3. 

[0029] A characteristic feature of this embodiment 
is, in particular, that when the user inputs a query condi- 
tion and display condition as shown in Fig, 2 and starts 
a search, all pieces of information matching the query 
condition are automatically downloaded. 
[0030] With a conventional browser, for example, 
when the user inputs a query condition in a patent 
search in the US Patent and Trademark Office, he/she 
can obtain a list of patents matching the query condi- 
tion. In order to download the contents of each patent, 
the user must click the patents in the patent list one by 
one. That is, cumbersome operation is required. With 
the use of a conventional Web data collective acquisi- 
tion program, the contents of the respective patents can 
be downloaded by automatically tracing URLs from the 
window of a retrieved patent list. However, this program 
has no page turning function. If, therefore, a patent list 
extends in many pages, the user must turn each page 
and designate download operation on each page. 
[0031] In contrast to this, according to this embodi- 
ment, when the user inputs a query condition and dis- 
play condition, as shown in Fig. 2, and starts a search, 
alt pieces of .information matching the query condition 
are automatically downloaded. Even if a retrieved pat- 
ent list extends in many pages, all patents matching the 
query condition are downloaded by automatically trac- 
ing the pages. 

[0032] Fig. 4 is a flow chart showing a search pro- 
cedure in this embodiment. In step 401, a target URL 
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table storing a search result and the like is initialized. 
The target URL table will be described later. In step 402, 
an initial search window like the one shown in Fig. 2 is 
displayed. In step 403, input information is received on 
the initial window. Then the user clicks the start button 5 
204, search processing in step 404 and the subsequent 
steps is started. In step 404, request information to be 
transmitted to an HTTP server as a search target is gen- 
erated in accordance with the input information like that 
shown in Fig. 2. In step 405, the request is transmitted 10 
to the HTTP server. In step 406, a response (search 
result) from the HTTP server is received and stored in 
the buffer In step 407, the status of the response is 
checked by referring to its header information. If an error 
is detected, a message is displayed, and the processing 15 
is terminated in step 408. If no error is detected, infor- 
mation is registered in the target URL table on the basis 
of the HTTP contents of the response. 
[0033] The target URL table will be described 
below. Each of Figs. 5 A to 5C shows an example of the 20 
arrangement of the target URL table. Fig. 5A shows the 
target URL table whose area is ensured and initialized 
in step 401. A nesting level 501 is a field in which the 
nesting level of each retrieved home page is stored. A 
sequence number 502 is a field in which sequence 25 
numbers added to search results in sequential order are 
stored. A URL 503 is a field in which the URLs of home 
pages as search results are stored. A title 504 is a field 
in which the tiles of home pages as search results are 
stored. A presence/absence 505 is a field in which infor- 30 
mation indicating whether each home page as a search 
result actually exists or not is stored. A local file name 
506 is a field in which the names of local files in which 
the contents of home pages as search results are 
stored. 35 
[0034] Referring back to Fig. 4, the search result 
received in step 406 is, for example, the page data of a 
list of URLs matching the query condition. For example, 
in a patent search in the US Patent and Trademark 
Office, a patent list containing links to the respective 40 
patents (the URLs of home pages in which the contents 
of the patents are written are embedded in the display 
portions indicating the patents) is received as the page 
data of the search result. In step 409, a predetermined 
file name is added to the page data of the patent list, the 45 
resultant data is stored as a local file, and the file name 
is registered in the local file name 506 of the target URL 
table in Fig. 5 A. Since the received search result corre- 
sponds to the parent-level search, T is set in the nest- 
ing level 501. In addition, since this data is registered so 
first in the target URL table in Fig. 5A, "1" is set in the 
sequence number 502. The URL and title of the page of 
the search result are respectively set in the URL 503 
and title 504. Since the page data actually exists, "pres- 
ence" is set in the presence/absence 505. 55 
[0035] After step 409, the HTTP contents of the 
response are analyzed to check in step 410 whether the 
next page is designated. For example, the designation 
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of the next page is detected by searching for a specific 
character such as "Next* or button for designating the 
display of the next page. Note that the pres- 
ence/absence of the next page may be determined by 
any scheme as long as it is determined by analyzing the 
HTTP contents of the response. Some page indicates 
the first 20 search results of a total of 100 search results 
with a description like "1-20 of 100". With such a page, 
the presence of the next page can be determined by 
checking this description. If the next page is designated, 
the flow returns to step 404 to repeat acquisition 
processing of the page data of the URL of the next 
page. With regard to the next page, "2" is registered in 
the sequence number 502 of the URL table in Fig. 5A. 
This processing is repeated until no next page is 
present, thus setting the target URL table in Fig. 5A. In 
step 404 after step 403, a request containing the query 
condition input in step 403 is generated. In step 404 
after step 41 0, a request for requesting the page data of 
the next page is generated. 

[0036] If there is no next page at nesting level "1", 
the flow advances from step 410 to step 41 1 to check 
whether the nesting level has reached the designated 
level. If NO in step 41 1 , the processing from step 404 is 
repeated for the next nesting level. For example, after 
the URL table in Fig. 5A at nesting level "1" is set, the 
files indicated by the local file name 506 registered in 
the table are looked up one by one to sequentially 
extract the URLs contained in the local files, and the 
processing from step 404 is performed. Figs. 5B and 5C 
respectively show URL tables with nesting levels "2" 
and "3 s . Note that the nesting levels are written like "1- 
1" and "1-1-1". Assume that in step 404 after step 41 1 , 
a request for requesting the page data of the URL con- 
tained in the above local file is generated. If it is deter- 
mined in step 41 1 that the processing is done up to the 
designated nesting level, the search processing is ter- 
minated. 

[0037] With the above processing, all pieces of 
information matching the query condition are acquired 
in the target URL table (and the local files indicated by 
the local file name 506), as shown in Figs. 5A to 5C. 
Even if search results are displayed in a list extending in 
a plurality of pages, information is automatically 
acquired from ail the pages without any manual opera- 
tion. 

[0038] Fig. 6 shows a procedure for outputting 
retrieved information, tn step 601 , the information of dis- 
play target fields designated in the box 232 in Fig. 2 is 
acquired by looking up the files indicated by the local file 
name 506 in the transmission in Figs. 5A to 5C. Assume 
that in a patent search in the US Patent and Trademark 
Office, "No.", "Applicant", "Title", and "Abstract" are des- 
ignated as display targets. In this case, the pieces of 
information of these fields are extracted from the 
respective local files. In step 602, the extracted informa- 
tion is shaped to be displayed/output in the form shown 
in Fig. 3. 
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[0039J In the above embodiment, the window in Fig. 
2 and the output sample in Fig. 3 are examples and may 
be modified, as needed. In addition, in the query input 
area 202 on the window in Fig. 2, another query condn 
tion may be input outside the range of the query pro- 
vided on the home page as the search target. In this 
case, a search is conducted within the range of the 
query provided on the home page as the search target. 
When a response as a search result is received in step 
406 in Fig. 4, a search is made under the added query 
condition, and only information matching the query con- 
dition may be stored in the buffer. In the above embodi- 
ment, all pieces of information matching the query 
condition are stored in the transmission and local files. 
However, only the pieces of information of fields desig- 
nated by the display condition may be stored in local 
files. 

[0040] In the above embodiment a program for 
executing the processing in Fig. 4 or 6 can be imple- 
mented in various forms, e.g., being newly generated as 
a browser, implemented by using a CGI (Common 
Gateway Interface) function, and implemented as the 
plug-in of an existing browser. In addition, such a pro- 
gram can be implemented independently of an OS and, 
for example, can be implemented on Windows (trade- 
name in Microsoft Corporation)-based OS or UNIX- 
based OS. 

[0041] In the above embodiment, a pre-registered 
search site is designated as a search target, and a 
query corresponding to each search site is displayed in 
the area 202. However, the display in the area 202 may 
be dynamically changed in accordance with a search 
target arbitrarily input by the user. Assume that the URL 
of a search target to be accessed for the first time is 
input in the area 201 . In this case, a page of the SETUP 
message may be received, and a portion for inputting a 
query may be extracted from the page data, thus dis- 
playing it in the area 202. 

[0042] A practical procedure for automatically gen- 
erating the initial search window in Fig. 2 from the page 
data of a search target will be described in detail below. 
A procedure for the search page of the US Patent and 
Trademark Office will be described first A procedure for 
the search page of the Japanese Patent Office will be 
described later. 

[0043] Fig. 7 shows a window of the search page of 
the US Patent and Trademark Office, and more specifi- 
cally, an example of displaying the HTML document of 
the search page without any modification. Figs. 8 to 10 
show the HTML sources. A display 700 in Fig. 7 corre- 
sponds to HTML contents from a (form) tag 800 to a 
(/form) tag 890 of the HTML sources in Figs. 8 to 10. A 
display element 701 corresponds to the HTML source 
801; a display element 702, to the HTML source 802; a 
display element 703, to the HTML source 803; a display 
element 704, to the HTML source 804; a display ele- 
ment 705, to the HTML source 805; a display element 
706, to the HTML source 806; a display element 71 1 , to 



the HTML source 81 1 ; and a display element 71 2, to the 
HTML source 812. When the user accesses this search 
page, these HTML sources are sent A window like the 
one shown in Fig. 7 can be obtained by displaying the 

5 sources with a general browser. 

[0044] Instead of the window in Fig. 7, the initial 
search window in Fig. 2 is generated and output by ana- 
lyzing the HTML contents from the (form) tag 800 to the 
(/form) tag 890 of the HTML sources. In this case, the 

w contents from (form) to (/form) may be basically dis- 
played as the window 202 in Fig. 2. Note, however, that 
the button portions 711 and 712 in Fig. 7 are written in 
(INPUT) tags like the HTML sources 811 and 812. For 
this reason, the (INPUT) tags with TYPE = "SUBMIT, 

is "BUTTON", and "RESET are converted into the form of 
HTML comments. More specifically, for example, 
(INPUT ....) may be converted into (HINPUT ....>--). 
[0045] If a problem arises when the window is dis- 
played without any modification, only tags for inputting 

20 values may be extracted to display them while arranging 
them vertically. In this case, the names of input values 
are made to match the search page. In the case of the 
search page of the US Patent and Trademark Office, for 
example, the HTML contents of (INPUT) and (SELECT) 

25 to (/SELECT) are extracted to be displayed while 
arranged in correspondence with each other. This is a 
general technique that can be applied to other search 
pages. 

[0046] With regard to the display of the window 203 
30 in Fig. 2, values indicated by the (OPTION) to (/OPTION) 
items in the (SELECT) tag of the HTML source may be 
extracted to be displayed in the same manner as the 
box 231 in Fig. 2. 

[0047] Fig. 1 1 shows a window of the search page 
35 of the Japanese Patent Office, and more specifically, an 
example of displaying the HTML document of the 
search page without any modification. Figs. 12 to 15 
show the HTML sources. A display 1 1 00 in Fig. 1 1 cor- 
responds to HTML contents from a (form) tag 1200 to a 
40 (/form) tag 1 290 of the HTML sources in Figs. 1 2 to 1 5. 
A display element 1101 corresponds to the HTML 



source 
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display 


element 


1102, 


to 


the 


HTML 


source 


1202; 
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display 


element 


1103, 


to 


the 


HTML 


source 
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display 


element 


1104, 


to 


the 


HTML 


45 source 
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display 


element 


1105, 


to 


the 


HTML 


source 


1205; 
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display 


element 


1106, 


to 


the 


HTML 


source 


1206; 
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display 


element 


1107, 


to 


the 


HTML 


source 


1207; 
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display 


element 


1108, 


to 


the 


HTML 


source 


1208; 
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display 


element 


1109, 


to 


the 


HTML 


so source 


1209; 
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display 


element 


1110, 


to 


the 


HTML 


source 


1210; and a display element 1 1 11, to the HTML 



source 121 1 . When the user accesses the search page, 
these HTML sources are sent. A window like the one 
shown in Fig. 1 1 can be obtained by displaying the 
55 sources with a general browser. 

[0048] In the case of the Japanese Patent Office, 
since input information is dynamically generated by 
Javascript (the HTML source 1222 in Fig. 15), (SCRIPT) 
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to (/SCRIPT) statements in the contents from the (form) 
tag 1200 to the (/form) tag must be extracted. 
[0049] An initial search window like the one shown 
in Rg. 2 may be generated from the HTML sources in 
Figs. 12 to 15 as in the above case of the search page s 
of the US Patent and Trademark Office. With regard to 
the display of the display candidate list in the box 231 in 
Fig. 2, the values indicated by the (OPTION) items of the 
(SELECT) tags 1202, 1205, and 1208 may be displayed. 
Note that in the case of the Japanese Patent Office, 
(/OPTION) is omitted. 

[0050] The second embodiment of the present 
invention will be described next. In the first embodiment 
described above, downloaded pages are stored in a 
URL table like the one shown in Figs. 5A to 5C. In the 
second embodiment, such a table is not generated, and 
a downloaded result is stored in a predetermined target 
storage directory. A description of portions similar to 
those in the first embodiment will be omitted, and only 
points different from those in the first embodiment will 
be described below. 

[0051 ] Fig. 1 6 shows an example (replacing the one 
shown in Fig. 2) of an initial search window in the sec- 
ond embodiment. When one of several pre-registered 
search targets is selected, the window in Fig. 1 6 is dis- 
played. This window 1601 includes query character 
string input areas 1602 and 1603, designation areas 
1604 and 1605 for fields in which query character 
strings exist, a query condition input area 1606, a 
search year input area 1 607, and a target storage direc- 
tory designation area 1608. The query character string 
input areas 1 602 and 1603 correspond to the areas 221 
and 224 in Rg. 2. The designation areas 1 604 and 1 605 
for fields in which query character strings exist corre- 
spond to the areas 222 and 225 in Fig. 2. The query 
condition input area 1606 corresponds to the area 223 
in Fig. 2. The search year input area 1607 corresponds 
to the area 226 in Rg. 2. 

[0052] The target storage directory designation 
area 1608 is used to designate a directory for storing a 
file in which a search result is downloaded. A search 
start button 1 609 is a button for giving an instruction to 
start a search. An end button 1 61 0 is a button for giving 
an instruction to end a search (forcibly end a search). 
The user inputs query character strings and query con- 
ditions on the window in Rg. 1 6, and starts a search by 
clicking the search start button 1609. 
[0053] In this embodiment a search result is stored 
in the target storage directory designation area 1608 but 
is not displayed as shown in Fig. 3. The user knows the 
search result by accessing the downloaded file after- 
ward, as needed. Therefore, result output processing 
like the one shown in Rg. 6 is not required. 
[0054] The search procedure in this embodiment is 
the same as that in Fig. 4 except that steps 404 to 409 
in Rg. 4 are replaced with a call statement for the OL 
(DownLoad) subroutine in Fig. 1 7. The DL subroutine is 
a recursive call subroutine. 



12 

[0055] The procedure in the DL subroutine will be 
described. In step 1701, a given argument a is down- 
loaded. The downloaded data is stored as a file to which 
a predetermined name is attached in a target storage 
directory. When a call is made first from the processing 
in Fig. 4, a URL containing a request for a search under 
the input query condition is designated as the argument 
a. A list of items matching the input query condition is 
obtained first as a response. In the case of the search 
page of the US Patent and Trademark Office, for exam- 
ple, a list of patents matching the input query condition 
can be acquired in step 1 701 . 
[0056] In step 1702, it is checked whether the cur- 
rent processing has reached the last nesting level. Nest- 
ing levels may be fixed to predetermined values or 
arbitrarily set. Assume that in this case, the page 
acquired first in step 1701 is at nesting level 1 , and the 
processing is to be terminated at nesting level 2 set as 
a fixed value. 

[0057] Since the page acquired first in step 1701, 
i.e., a list of patents matching the query condition, is at 
nesting level 1 , the flow advances from step 1 702 to 
step 1703. In step 1703, it is checked whether any link 
is present in the acquired page. If YES in step 1 703, the 
DL subroutine is recursively called in step 1 704. In this 
case, a URL designating the link is transferred as an 
argument. 

[0058] In the DL subroutine activated by the recur- 
sive call, the contents of the URL of the link are down- 
loaded in step 1 701. The downloaded data is stored as 
a file with a predetermined name in a target storage 
directory. The nesting level is then checked in step 
1 702. In this case, since nesting level 2 is determined, 
the processing is terminated. If download operation up 
to a lower nesting level is designated, the processing in 
step 1 703 and the subsequent steps may be continued. 
[0059] If the flow returns after the recursive call in 
step 1704, it is checked in step 1705 whether the 
processing is completed up to the end of the currently 
processed page. If NO in step 1 705, the flow returns to 
step 1703 to continue the processing for the link in the 
page. If YES in 1 705, the processing is terminated. 
[0060] In the above manner, downloaded files are 
stored in the target storage directory. As a conse- 
quence, all patent files matching the query condition are 
stored in the target storage directory, and hence the 
user can use them, as needed. 
[0061] More specifically, when downloaded data is 
to be stored in the target storage directory, a directory 
named as "query character string 1 -query character 
string 2" is generated under the target storage directory, 
and the file of the downloaded data is stored under the 
directory. In the case of the query character strings in 
Fig. 16, a directory named as "electronic commercial 
transaction-internet" is generated, and a downloaded 
file is stored under the directory. If a query character 
string includes a character that cannot be used for a 
directory name, e.g., ",", this character is replaced with 
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■J (underline). 

[0062] In the first and second embodiments, 
processing is continued until data matching a query 
condition are downloaded. If, however, the user gives an 
instruction to cancel or terminate the processing during s 
the processing, the processing is forcibly terminated. 
Further, if there is provided a input area of a number of 
pages to be downloaded, the information acquisition is 
terminated when the page data of the number is down- 
loaded. 10 
[0063] As has been described above, according to 
the present invention, on a search site on the Internet, 
even if URLs matching a query condition extend in a 
plurality of pages, the next page is detected, and infor- 
mation is automatically acquired. Therefore, the con- 15 
tents of URLs matching the query can be collectively 
acquired automatically without any manual operation. In 
addition, since only the information of designated fields 
is displayed by inputting a display condition, the user 
need not perform any operation of extracting only nec- 20 
essary portions from the acquired information, but a 
search result containing only necessary information can 
be automatically obtained. 

[0064] According to its broadest aspect the inven- 
tion relates to a Web data acquisition apparatus for 25 
acquiring Web data of a page for displaying or retrieving 
information matching a query condition upon inputting 
the query condition on the Internet and Intranet, or an 
extranet, comprising: means for designating a page as a 
search target; means for inputting a query condition; 30 
and means for generating a request for giving an 
instruction to execute a search under the query condi- 
tion and transmitting the request to a server as a search 
target 

35 

SUMMARY OF THE INVENTION 
[0065] 

1 . A Web data acquisition apparatus for acquiring 40 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet an intranet or 
an extranet, comprising: 

45 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 

means for generating a request for giving an 

instruction to execute a search under the query so 

condition, and transmitting the request to a 

server as a search target; 

means for receiving a response as a search 

result; 

storage means for storing contents of the ss 
response; 

means for detecting presence/absence of a 
next page from the contents; and 



means for, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page and storing the data in said storage 
means, and repeating the operation as long as 
a next page is detected. 

2. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 
an extranet, comprising: 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 

means for generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

means for receiving a response as a search 

result; 

storage means for downloading the linked page 
by tracing a link in the response, and storing 
the downloaded page data; 
means for detecting presence/absence of a 
next page from the response; and 
means for, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, downloading the linked page by 
tracing a link in the acquired page data, and 
storing the downloaded page data in said stor- 
age means, and repeating the operation as 
long as a next page is detected. 

3. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 
an extranet, comprising: 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 

means for generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target 

means for receiving a response as a search 

result; 

means for detecting presence/absence of a 
next page from contents of the response; and 
means for, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, and repeating the operation as long 
as a next page is detected. 
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4. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 
an extranet, comprising: 5 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 
means for generating a request for giving an 10 
instruction to execute a search under the query 
condition, and transmitting the request to a 
server as a search target; 
means for receiving a response as a search 
result; 15 
means for downloading the linked page by trac- 
ing a link in the response; 
means for detecting presence/absence of a 
next page from the response; and 
means for, if the next page is detected, per- 20 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, downloading the linked page by 
tracing a link in the acquired page data, and 
repeating the operation as long as a next page 25 
is detected. 

5. An apparatus 

wherein said apparatus further comprises 
means for designating a nesting level at which infor- 30 
mation is acquired, and acquires information by 
tracing a URL contained in the contents up to the 
nesting level. 

6. An apparatus 

wherein a search is made for information 35 
stored in said storage means under another query 
condition, and the research result is stored in said 
storage means. 

7. An apparatus 

wherein 40 

said apparatus further comprises means for 
inputting a display condition for determining 
information of a specific field of retrieved infor- 
mation, and 45 
the information of the field designated under 
the display condition is extracted from the infor- 
mation stored in said storage means and is dis- 
played. 

so 

8. An apparatus 

further comprising means for terminating 
information acquisition on the basis of a predeter- 
mined condition instead of continuing Information 
acquisition as long as a next page is detected. 55 

9. A Web data acquisition method of acquiring Web 
data of a page for displaying or retrieving informa- 
tion matching a query condition upon inputting the 



query condition on the Internet, an intranet, or an 
extranet, comprising: 

the step of designating a page as a search tar- 
get; 

the step of inputting a query condition; 

the step of generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

the step of receiving a response as a search 

result; 

the storage step of storing contents of the 
response; 

the step of detecting presence/absence of a 
next page from the contents; and 
the step of, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page and storing the data in the storage 
step, and repeating the operation as long as a 
next page is detected. 

10. A Web data acquisition method of acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 
an extranet, comprising: 

the step of designating a page as a search tar- 
get; 

the step of inputting a query condition; 

the step of generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

the step of receiving a response as a search 

result; 

the storage step of downloading the linked 
page by tracing a fink in the response, and stor- 
ing the downloaded page data; 
the step of detecting presence/absence of a 
next page from the response; and 
the step of, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, downloading the linked page by 
tracing a link in the acquired page data, and 
storing the downloaded page data in the stor- 
age step, and repeating the operation as long 
as a next page is detected. 

1 1 . A method 

wherein the method further comprises the 
step of designating a nesting level at which informa- 
tion is acquired, and information is acquired by trac- 
ing a URL contained in the contents up to the 
nesting level. 
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12. A method 

further comprising making a search for infor- 
mation stored as the file under another query con- 
dition, and storing the research result as a file. 

1 3. A method s 

wherein 

the method further comprises the step of input- 
ting a display condition for determining infor- 
mation of a specific field of retrieved w 
information, and 
- the information of the field designated under 
the display condition is extracted from the infor- 
mation stored as the file and is displayed. 

J 5 

14. A method 

further comprising terminating information acquisi- 
tion on the basis of a predetermined condition 
instead of continuing information acquisition as 
long as a next page is detected. 20 

15. A program storage medium storing a program 
for the Web data acquisition method. 

Claims 

25 

1. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 

an extranet, comprising: 30 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 

means for generating a request for giving an 35 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

means for receiving a response as a search 

result; 40 

storage means for storing contents of the 

response; 

means for detecting presence/absence of a 
next page from the contents; and 
means for, if the next page is detected, per- 45 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page and storing the data in said storage 
means, and repeating the operation as long as 
a next page is detected. so 

2. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 

the query condition on the Internet, an intranet, or 55 
an extranet, comprising; 

means for designating a page as a search tar- 



get; 

means for inputting a query condition; 

means for generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

means for receiving a response as a search 

result; 

storage means for downloading the linked page 
by tracing a link in the response, and storing 
the downloaded page data; 
means for detecting presence/absence of a 
next page from the response; and 
means for, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, downloading the linked page by 
tracing a link in the acquired page data, and 
storing the downloaded page data in said stor- 
age means, and repeating the operation as 
long as a next page is detected. 

3. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 
an extranet, comprising: 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 

means for generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

means for receiving a response as a search 

result; 

means for detecting presence/absence of a 
next page from contents of the response; and 
means for, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, and repeating the operation as long 
as a next page is detected. 

4. A Web data acquisition apparatus for acquiring 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 
the query condition on the Internet, an intranet, or 
an extranet, comprising: 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; 
means for generating a request for giving an 
instruction to execute a search under the query 
condition, and transmitting the request to a 
server as a search target; 
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means for receiving a response as a search 
result; 

means for downloading the linked page by trac- 
ing a link in the response; 
means for detecting presence/absence of a 5 
next page from the response; and 
means for, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, downloading the linked page by w 7. 
tracing a link in the acquired page data, and 
repeating the operation as long as a next page 
is detected. 

5. An apparatus according to any of the preceding 15 
claims wherein said apparatus further comprises 
means for designating a nesting level at which infor- 
mation is acquired, and acquires information by 
tracing a URL contained in the contents up to the 
nesting level, 20 

and/or wherein preferably 

a search is made for information stored in said 
storage means under another query condition, 
and the research result is stored in said storage 25 
means, 

and/or wherein preferably 
said apparatus further comprises means for 
inputting a display condition for determining 
information of a specific field of retrieved infor- 30 
mation, and 

the information of the field designated under 
the display condition is extracted from the infor- 
mation stored in said storage means and is dis- 
played, 35 

and/or further preferably 
comprising means for terminating information 
acquisition on the basis of a predetermined 
condition instead of continuing information 8. 
acquisition as long as a next page is detected. 40 

6. A Web data acquisition method of acquiring Web 
data of a page for displaying or retrieving informa- 
tion matching a query condition upon inputting the 
query condition on the Internet, an intranet, or an as 
extranet, comprising: 

the step of designating a page as a search tar- 
get; 

the step of inputting a query condition; so 
the step of generating a request for giving an 
instruction to execute a search under the query 
condition, and transmitting the request to a 
server as a search target; 

the step of receiving a response as a search 55 
result; 

the storage step of storing contents of the 
response; 



the step of detecting presence/absence of a 
next page from the contents; and 
the step of, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page and storing the data in the storage 
step, and repeating the operation as long as a 
next page is detected. 

A Web data acquisition method of acquiring Web 
data of a page for displaying or retrieving informa- 
tion matching a query condition upon inputting the 
query condition on the internet, an intranet, or an 
extranet, comprising: 

the step of designating a page as a search tar- 
get; 

the step of inputting a query condition; 

the step of generating a request for giving an 

instruction to execute a search under the query 

condition, and transmitting the request to a 

server as a search target; 

the step of receiving a response as a search 

result; 

the storage step of downloading the linked 
page by tracing a link in the response, and stor- 
ing the downloaded page data; 
the step of detecting presence/absence of a 
next page from the response; and 
the step of, if the next page is detected, per- 
forming operation of acquiring data of the next 
page by transmitting a request to acquire the 
next page, downloading the linked page by 
tracing a link in the acquired page data, and 
storing the downloaded page data in the stor- 
age step, and repeating the operation as long 
as a next page is detected. 

A method according to any of the preceding claims 
wherein the method further comprises the step of 
designating a nesting level at which information is 
acquired, and information is acquired by tracing a 
URL contained in the contents up to the nesting 
level, 

and/or further preferably 

comprising making a search for information 
stored as the file under another query condi- 
tion, and storing the research result as a file, 
and/or wherein preferably 
the method further comprises the step of input- 
ting a display condition for determining infor- 
mation of a specific field of retrieved 
information, and 

the information of the field designated under 
the display condition is extracted from the infor- 
mation stored as the file and is displayed, 
and/or further preferably 
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comprising terminating information acquisition 
on the basis of a predetermined condition 
instead of continuing information acquisition as 
long as a next page is detected. 

5 

9. A program storage medium storing a program for 
the Web data acquisition method defined in of the 
preceding claims. 

10. A Web data acquisition apparatus for acquiring w 
Web data of a page for displaying or retrieving infor- 
mation matching a query condition upon inputting 

the query condition on the Internet, and Intranet, or 
an extranet, comprising: 

75 

means for designating a page as a search tar- 
get; 

means for inputting a query condition; and 
means for generating a request for giving an 
instruction to execute a search under the query 20 
condition and transmitting the request to a 
server as a search target. 
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FIG.7 



Help | | Home | | Boolean | | Manual | | Number [ [Order Copy] | PTDLs 



US Patent Full-Text Database Boolean Search 

Data current through 05/25/1999 



Query [Help] 
Term 1:1 



i 



700 



701 



T 



Term 2: 



/•704 



703 



in Field 1: All Fields 




T 


And T 


7o£ 




in Field 2: All Fields 




▼ | 



r\ \JU 7J2 
\ Reset / 



Select years fHelpl | 1998-1999 1 T 



706' 



T 



705 



Search ^ Reset 
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<HTML> 
<HEAD> 

<TITLE>US Patent Full-Text Database Boolean Search</TITLE> 
</HEAD> 

<BODY> BGCOLOR="#FFFFFF> 
<center> 

<!-<IMG border=0 src=/rietaicon/PTO/titlebar.gifxbr>— > 
<A HREF= ,, http://www.uspto.gov/patft/help.htm ,, > 
<IMG border=0 width=63 height=24 SRC=7netaicon/PTO/heip.girALT=' , [Help] ,, x/A> 
<A HREF="http^/www.uspto.gov/patft/index.htmi ,, > 

<IMG width=63 height=24 border=0 SRC=7netaicon/PTO^ome.girALT= n [Home]"></A> 
<A HREF=7netahtml/search-bool.htrnl"> 

<IMG border=0 width=63 height=24 SRC»7netaicon/PTO/booIean.gir ALT=*[ Boolean Search]*x/A> 
<A HREF«7netahtml/search-adv.htm' , > 

<IMG width=63 height=24 border=0 SRC=7netaicon/PTO/manual.gir ALT="[Manual Search]"x/A> 
<A HREF=**netahtml/srchnum.htm ,, > 

<IMG width=63 height=24 border=0 SRC=7netaicon/PTO/number.girALT= M (Number Search]"x/A> 
<A HREF= "http://www.uspto.gov/go/ptcs/"> 

<IMG width=63 height=24 border=0 SRC=7netaicon/PTO/copy.gif' ALT="[Order Copy]"x/A> 
<A HREF= rt http://www.uspto.gov/go/ptdIT> 

<IMG width=63 height=24 border=0 SRCa7netaicon/PTO/ptdl.gir ALT="[PTDLsr></A> 

<hl>US Patent Full-Text Database Boolean Search</hl> 

</center> 

<!-DATA-xH3 ALIGN="CENTER f *xFONT COLOR= ,, darkblue">Data current through 05/25/1999 

</fontx/h3> 

<form action=7netacgi/nph-Pa^se^ ,, method=' , GET'> " ~ — 800 

<INPUT TYPE="HIDDEN' t NAME="Sectr VALUE="PT02"> 

<INPUT TYPE»"HIDDEN" NAME^Sec^ VALUE«"HITOFF , > 

<INPUT TYPE»"HIDDEN" NAME»V VALUED 1"> 

<INPUT TYPE="HIDDEN n NAME="u" VALUE=7netahtml/search-bool.htmr> 
<INPUT TYPE="HIDDEN" NAME="r" VALUE="0"> 
<INPUT TYPE="HIDDEN W NAME=T VALUE="S"> 
<INPUT TYPE="HIDDEN" NAMEsT VALUE="50 W > 

<table> 

<td align=leftxPxb> Query <A HREF="http://www.uspto.gov/patft/helpbool.htm ,, >[Help]</A></b> 

<BRx/td> 

<trxtd align=right> 

<TDx/TD> 

<trxtd> 

<STRONG>Tcrm 1:</STR0NG> 

<INPUT TYPE="TEXT" N AME= TERM 1 * SIZE=20> " 801 

</tdxtd align=right> 

in <B>Field 1:</B> 

</tdxtd> 

<SELECT NAMEs" FIELD 1 "> 

<option selected value= wr >All Fields</option> \ 

<option values tt TTL <, >TitIe</option> 

<option valuea"ABST , '>Abstract</option> 

<option value= M ISD rt >Issue Date</option> \ 

<option value="WKU">Patent Number</option> \ 802 

<option values , 'APD ,, >Application Date</option> / 

<option valuea^APN'^Application Serial Number</option> 

<option va!ue="APT w > Application Type</option> 

<option vaIue="ASNM">Assignee Name</option> 

<option value="ASCr>Assignee City</option> 

<option value= n ASST>Assignee State</optton> j 

<option value-" ASCO">Assignee Country</option> J 
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<option vaIue="CIPC>Intemational Classification</option> 
<option valuc= B ORCL' t >Issucd US Classification</option> 
<option values'EXP^Primary Examincr</option> 
<option values" EXA"> Assistant Examiner</optton> 
<option values"INZZ">Inventor Name</option> 
<option value= H INCr>Invcntor City</option> 
<option value="INST>Inventor Statc</option> 
<option values"INCO">Inventor Country</option> 
<option values"GOVT">Government Intcrcst</option> 
<option values"LREP">Attorney or Agent</option> 
<option values"PCTA ,, >PCT Information</option> 
<option value="PRFR">Foreign Priority</option> 
<option values"REIS">Reissue Data</option> 
<option value="RLAP rt >Related US App. Data</option> 
<option values"UREF">US Referenccs</option> 
<option value="FPRF>Foreign References</option> 
<optlon valuc="ORPL">Other Rcferences</option> 
<option values" ACLM">aaim(s)</option> 
<option value="PPDB">Descriptioii/Specification</option> 

</SELECT> 
</td></tr><tr><td></tdxtd align=middlc> 
<SELECT NAME="col"> *\ 
<OPTION value="AND" SELECTED>AND</option> I 
<OPTION value=*OR">OR</option> V 803 

<OPTION valuc= ,, NOT">ANDNOT</option> J 
</select> J 
</td></tr><tr><td> 
<STRONG>Term 2:</STRONG> 

<INPUT TYPEaTEXT" NAME=TERM2" SIZE=20> 804 

</tdxtd align=right> 

in <B>Field 2: </B> 

</tdxtd> 

<SELECT NAMEs"FIELD2 w > 

<option selected values"">AH Fields</option> 
<option value="TTL">Title</option> 
<option value="ABST">Abstract</option> 
<option value="ISD*>Issue Date</optton> 
<option value= M WKU">Patent Number</option> 
<option values" APD">Application Date</option> 
<option values" APN">Application Serial Number</option> 
<option values" APT"> Application Type</option> 
<option values" ASNM">Assignee Namc</option> 
<option values"ASCT>Assignee City</option> 
<option value="ASST">Assignee State</option> 
<option values ,, ASCO">Assignee Country</option> 
<option value="CIPC">Intemational CIassification</option> 
<option values"ORCL">lssued US Classiftcation</option> 
<option value="EXP">Primary Examiner</option> 
<option values "EXA"> Assistant Examiner</option> 
<option vaIue="lNZZ">Inventor Name</option> 
<option value="INCr>Inventor City</option> 
<option values w INST ,, >Inventor State</option> 
<option value="INCO*>Inventor Country </option> 
<option values'GOVT'^ovemment Interest</option> 
<option vaIue="LREP">Attorney or Agent</option> 
<option value= rt PCTA">PCT Information</option> 
<option valucs"PRFR">F6reign Priority</option> 
<option values"REIS">Reissue Data</option> 
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fig.io 



<option value^RLAP^Related US App. Data</option> 
<option value="UREP>US Refcrenccs</option> 
<option value="FPRF">Forcign Rcfcrences</option> 
<option value="ORPL' , >Othcr Refercnces</option> )S05 
<option value="ACLM*>Clatm(s)</option> 
<option value= ,, PPDB' , >E>escription/Spccificaiion</option> 

</SELECT> 
</td></tr> 
<td></td> 
</table> 
<table> 
<trxtd> 

<b>Sclect years <A HREF="http://www.uspto.gov/patft/hclpdb.htm' , >(HelpJ</A></strong> 

</b> 

<S ELECT SIZE=1 NAME="d*> 

<OPTION VALUE==curr>1998-1999</option> "\ 
<OPTION VALUE=paiI>AII ycars</optton> * 
<OPTION VALUE=pt97>1997</option> 
<OPTION VALUE=pt96>1996</option> 
<OPTION VALUE^pt95>1995</option> 
<OPTION VALUE=pt94>1994</option> 
<OPTION VALUE=pt93>1993</option> 
<OPTION VALUE=pt92>1992</option> 
<OPTION VALUE=pt91>1991</option> 
<OPTION VALUE=pt90>1990</option> 

<OPTION VALUE=pt89>1989</option> V 806 

<OPTION VALUE=pt88>1988</option> 
<OPTION VALUE=pt87>1987</option> 
<OPTION VALUE=pt86>1986</option> 
<OPTION VALUE=pt85>1985</option> 
<OPTION VALUE=pt84>1984</option> 
<OPTION VALUE=pt83>1983</option> 
<OPTION VALUE=pt82>1982</option> 
<OPTION VALUE=pt81>198i</option> 
<OPTION VALUE=pt80>1980</option> 
<OPTION VALUE=pt79>1979</option> 
<OPTION VALUE=pt78>1978</option> 
<OPTION VALUE=pt77>1977</option> 
<OPTION VALUE=pt76>1976</option> 
</SELECT> 

&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp 
&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp 
&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp 

<AUGN="right ,, ><INPUT TYPE= "SUBMIT* VALUE^'Search'V 811 

&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp&nbsp 

<ALIGN= ,, righr VAUGN=' , MIDDLE"><INPUT TYPE="RESET> —812 

</tdx/td> 
</table> 

</FORM> 890 

</BODY> 
</HTML> 
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FIG. 11 



Publication Classification 
©Japanese Patent Laid-Open 



(Japanese Patent Laid-Open \ 
International Patent Laid-Open j \ 
International Patent Re-Laid-Open J * 

OJapanese Utility Model Laid-Open 

(Japanese Utility Model Laid-Open \ 
International Utility Model Laid-Open \ 
International Utility Model Re-Laid-Open J 
Utility Model / 

OJapanese Patent Publication 

(Japanese Patent Publication \ 
Registered Japanese Patent Publication ) 

OJapanese Utility Model Publication / 

/ Japanese Utility Model Publication \ ' 

\ Registered Japanese Utility Model Publication j 



1101 



Select Search Items 



Title Of Invention 



1102 

W1105 Query Keyword 



I Title Of Invention 



Title Of Invention 




NU08 M109 

Search Lines Are Subjected To AND Operator (Read HELP For Details) 

1111 



Execute Search 



uoo 
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FIG.12 



<HTML> 
<HEAD> 



<TITLE>PUBLICATION TEXT SEARCH (QUERY CONDITION SETTING) </TITLE> 
<SCRIPT LANGUAGE=JAVASCRIPT> 
var FLG="ON"; 
var ERR=" M ; 
function sub(inp) { 

if (FLG== M ON") { 
if (Check KeywordQ==true) { 
FLG="exe"; 
inpsubmitQ; 



function Check_Keyword() { 
if( CheckWordCount(document.FORMl.key^valueOl) == false ) { 
alert ("OMIT SEARCH LINE AND EXECUTE SEARCH") ; 
return false ; 
} 

if( CheckWordCount(document.FORMl.key_vaIue02) == false ) { 
alert ("OMIT SEARCH LINE AND EXECUTE SEARCH") ; 
return false ; 
} 

if( Check WordC6unt(document. FORM l.key_value03) »» false ) { 
alert ("OMIT SEARCH LINE AND EXECUTE SEARCH") ; 
return false ; 



function CheckWordCount( Object ){ 
var String ; 
String = Object. value ; 

if( navigator-appName m "Microsoft Internet Explorer" ) 
if( String.length > 100 ) return false ; 
return true ; 



</SCRIPT> 
</HEAD> 

<BODY TEXT="#000000" LINK="#0000fT VUNK="#ff00fP ALINK="#ff0000" 

BGCOLOR="#COCOCO" onLoad="if(ERR != ") 

aIert(ERR);"> 



<FORM NAME=F0RM1 ACTION=7journal/search.cgi" TARGET="main" 

method="POST"> 

<NOBR> 



} 



return true ; 
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FIG.13 



<B>PUBLICAT10N CLASSIFICATION</B><BR> 

<IN?UT TYPE= n radio n NAME="class" VALUE=*AA" CHECKED> JAPANESE PATENT LAID-OPEN \ 
( JAPANESE PATENT LAID-OPEN, INTERNATIONAL PATENT LAID-OPEN, INTERNATIONAL 
PATENT RE-LAI D-0 PEN ) <BR> 
<INPUT TYPE^radio" NAME="ctass" VALUE^UU^ JAPANESE UTILITY MODEL LAID-OPEN 
( JAPANESE UTILITY MODEL LAID-OPEN, INTERNATIONAL UTILITY MODEL LAID-OPEN, 
INTERNATIONAL UTILITY MODEL RE-LAID -OPEN. UTILITY MODEL ) <BR> 
<INPUT TYPE radio" NAME="class" VALUE= H BB-> JAPANESE PATENT PUBLICATION 

( JAPANESE PATENT PUBLICATION, REGISTERED JAPANESE PATENT PUBLICATION ) <BRs 
< INPUT TYPE^radio- NAME= w c1ass" VALUE^YY'S JAPANESE UTILITY MODEL PUBLICATION 
( JAPANESE UTILITY MODEL PUBLICATION, REGISTERED JAPANESE UTILITY MODEL 
PUBLICATION ) <BR> 



>1201 



<HR> 
<TABLE> 

<TR><TH>SELECT SEARCH ITEM<TH>QUERY KEYWORD <TH>QUERY 
<TRxTD> 

<SELECT NAME="s_itcmOr> 

<OPTION VALUE=n">TITLE OF INVENTION 

<OPTION VALUE="2">ABSTRACT 

<OPTION VALUE= M 3 W >CLAJMS 

<OPTION VALUE=" 4" > ABSTRACT + CLAIMS 

<OPTION VALUE="2r>IPC 

<OPTION VALUE= , '22">FI 

<OPTION VALUE= M 23">FACET IDENTIFIER 

<OPTION VALUE="24*>APPLICANT/ ASSIGNEE 

<OPTlON VALUE="25">APPUCANT IDENTIFICATION NUMBER 

<OPTION VALUE="26 H >INVENTOR 

<OPTION VALUE="27 M >AGENT 

<OPTION VALUE="28">EXAMINER 

<OPTION VALUE="29">APPUCATION NUMBER 

<OPTION VALUE="30">FILING DATE 

<OPTION VALUE="3r>LAID-OPEN NUMBER 

<OPTION VALUE='32">LAID-OPEN DATE 

<OPTlON VALUE="33->PUBUCAT10N NUMBER 

<OPTION VALUE="34">PUBLICATION DATE 

<OPTION VALUE="35 ,, >REGISTRATION NUMBER 

<OPTION VALUE= W 36"> REGISTRATION DATE 

<OPTION VALUE="37"> REGISTERED PUBUCATION ISSUE DATE 

<OPTION V ALUE="38" > INTERNATIONAL PATENT LAID-OPEN NUMBER 

<OPTION VALUE="39">lNTERNATIONAL PATENT LAID-OPEN DATE 

<OPTION VALUE="40 r, >INTERNATIONAL PATENT REISSUE DATE 

<OPTION VALUE="41">PUBLICATION ISSUE DATE 

<OPTION VALUE="42">TRIAL NUMBER 

<OPTlON VALUE3"43">INTERNATI0NAL APPLICATION NUMBER 

<OPTlON VALUE="44">INTERNAT10NAL FIUNG DATE 

<OPTlON VALUE=*45">INTERNAT10NAL LAID-OPEN NUMBER 

<OPTION VALUE='46"> INTERNATIONAL LAID-OPEN DATE 

<OPTION V ALUE="47"> DECLARAT10N-OF-PRIO RTTY COUNTRY NUMBER 

<OPTION VALUE='*48 ,, >DESIGNATED COUNTRY 

<OPTION VALUE="49">LAID-OPEN, INTERNATIONAL LAID-OPEN DATE 

</SELECT> 

<TD> 

<INPUT TYPE="texr NAME="key_value01* SIZ£-"40" MAXLENGTH=."200"> 
<TD> 



1202 



1203 
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FIG.14 



<SELECT NAME=* rt kjtemOr> 
<OPTION VALUED r>OR 



1204 



<OPTION VALUE="2">AND I 
</SELECT> J 
<TRxTD> 

<SELECT NAME="s item02"> 

<OPTION VALUE= B 1">TITLE OF INVENTION 

<OPTION VALUE="2">ABSTRACT 

<OPTION VALUE= M 3 M >CLAIMS 

<OPTION VALUE="4"> ABSTRACT + CLAIMS 

<OPTION VALUE="2r>IPC 

<OPTION VALUE= H 22 ,, >n 

<OPTION VALUE="23">FACET IDENTIFIER 

<OPTION VALUE»'*24 ,, > APPLICANT/ ASSIGNEE 

<OPTION VALUE="25 W >APPLICANT IDENTIFICATION NUMBER 

<OPTION VALUE="26">INVENTOR 

<OPTION VALUE="27 W >AGENT 

<OPTION VALUE="28">EXAMINER 

<OPTION VALUE="29">APPLICATION NUMBER 

<OPTION VALUE="30">APPUCATION DATE 

<OPTION VALUE="31">LAID-OPEN NUMBER 

<OPTION VALUE= B 32 ,, >LAID-OPEN DATE 

<OPTION VALUE="33">PUBUCATION NUMBER 

<OPTION VALUE="34*>PUBUCATION DATE 

<OPTION VALUE=s n 35' , >REGISTRATION NUMBER 

<OPTION VALUE="36">REGISTRATION DATE 

<OPTION VALUE="37 W >REGISTERED PUBUCATION ISSUE DATE 

<OPTION VALUE="38">INTERNATIONAL PATENT LAID-OPEN NUMBER 

<OPTION VALUE3"39 W >INTERNATI0NAL PATENT LAID-OPEN DATE 

<OPTION VALUE«"40">INTERNATIONAL PATENT REISSUE DATE 

<OFTION VALUE= B 41 H >PUBUCATION ISSU DATE 

<OPTION VALUE="42">TRIAL NUMBER 

<OPTION VALUE=" 43">INTERNATIONAL APPUCATION NUMBER 

<OPTION VALUE=s"44 n > INTERNATIONAL FILING DATE 

<OPTION VALUE= H 45">INTERNATIONAL IjUD-OPEN NUMBER 

<OPTION VALUE="46">INTERNATIONAL LAID-OPEN DATE 

<OPTION VALUE= n 47' , >DECLARATION-OF-PRIOIUTY COUNTRY NUMBER 

<OPTION VALUE="48">DESIGNATED COUNTRY 

<OPTION VALUE="49">LAID-OPEN, INTERNATIONAL LAID-OPEN DATE 

</SELECT> 

<TD> 

<INPUT TYPE="texf NAME^key.valueCE" SIZE-'W MAXLENGTH=*'200 ,, > 
<TD> 

<SELECT NAME="k_item02 ,, > ^ 



<OPTION VALUE="2">AND I 
</SELECT> ' 
<TR><TD> 

<SELECT NAME="s_item03"> A 
<OPTION V ALU E=" 1 " >TTTLE OF INVENTION >1208 
<OPTION VALUE="2">ABSTRACT J 
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FIG.15 



<OPTION VALUE^^CLAIMS 
<OPTION VALUE= "4">ABSTRACT + CLAIMS 
<OPTION VALUE= B 2r>IPC 
<OPTION VALUE= M 22 w >n 
<OPTION VALUE="23">FACET IDENTIFIER 
<OPTION VALUE-'74*>APPUCANT/ ASSIGNEE 
<OPTION VALUE3»25">APPLICANT IDENTIFICATION NUMBER 
<OPTION VALUE-^^INVENTOR 
<OPTION VALUE="27">AGENT 
<OPTION VALUE- W 28 W >EXAMINER 
<OPTION VALUE="29">APPUCATION NUMBER 
<OPTION VALUE= N 30">FIUNG DATE 
<OPTION VALUE*"3r>LAID-OPEN NUMBER 
<OPTION VALUE»"32 ff >LAID-OPEN DATE 
<OPTION VALUEs^^PUBLlCATlON NUMBER 

<OFTION VALUE-"34 tt >PUBUCATION DATE > 1208 

<OFTlON VALUE= t1 35 M >R£GlSTRATION NUMBER 
<OnTON VALUE="36 fl >REGlSTRATION DATE 
<OPTION VALUE="37">REGISTERED PUBUCATION ISSUE DATE 
<OPTION VALUE«*38">INTERNATIONAL PATENT LAID-OPEN NUMBER 
<OPTION VALUE="39">INTERNATIONAL PATENT LAID-OPEN DATE 
<OPTION VALUE= W 40">INTERNATIONAL PATENT REISSUE DATE 
<OPTION VALUE=*4r>PUBLICATlON ISSUE DATE 
<OPTION VALUE= M 42 ,, >TRIAL NUMBER 

<OPTION V ALUE= * , 43" >INTERN ATIONAL APPLICATION NUMBER 
<OPTION VALUE= n 44">INTERNATIONAL FILING DATE 
<OPTION VALUE= "45 *> INTERNATIONAL LAID-OPEN NUMBER 
<OPTION VALUE« M 46 ^INTERNATIONAL LAID-OPEN DATE 
<OPTION VALUEa"47">DECLARATION-OF-PRIORITY COUNTRY NUMBER 
<OPTION VALUE= "48" > DESIGNATED COUNTRY 

<OPTION VALUE="49' , >LAID-OPEN t INTERNATIONAL LAID-OPEN DATE 
</SELECT> 
<TD> 

<INPUT TYPE= "text" NAME="key_valueQ3" SIZE=* H 40" MAXLENGTH=» H 200"> 1209 

<TD> 

<SELECT NAME="k_item03"> "| 
<OPTION VALUEs* 1">0R U210 
<OPTION VALUE="2">AND I 
</SELECT> ' 
</TABLE> 
<P> 

SEARCH LINES ARE SUBJECTED TO AND OPERATOR ( READ HELP FOR DETAILS ) 
<BR> 

<INPUT TYPE="buttoiT VALUE*" EXECUTE " onaick^subOhis-form);^ - 1211 

<INPUT TYPE^hidden" NAM E=" term exp" VALUE=M" CHECKED> 

<INPUT TYPE="hidderT NAME="end M VALUE="END W CHECKED> 

<SCRIPT LANGUAGE=JAVASCRIPT> 
A=ncw DatcO; V 1222 
documcnt-writ^^INPUT TYPE=*"hidden¥ w NAM E=V tt END2¥ w VALUE=¥""+A.gctTimc()+"¥ rt 

CHECKED>") 

</SCRlPT> 

</FORM> 1290 

</NOBR> 
</BODY> 



j 



-</HTML> 
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FIG.16 
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Electronic 
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Commerce 







Query Character String2 



1604 



internet 



Query Start 
End 



-1609 
-1610 



Query Condition Search Years 



s 1605 



Option 



And 



Iast2years 



607 



Target Storage Directory 



C:¥USPTO Search 
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FIG.17 
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